XDoc is a unified pre-training model capable of processing documents in different formats through a single model. With only 36.7% of the parameters, XDoc achieves comparable or superior performance in downstream tasks, offering significant cost-effectiveness in practical deployment.
Large Language Model
Transformers